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ABSTRACT 

-We- presentla  general  procedure  for  obtaining  exact  confidence 
regions  for  the  variance  components  in  unbalanced  mixed  linear  models. 

The  procedure  utilizes,  as  pivotal  quantities,  quadratic  forms  that  may 
depend  on  the  variance  components  in  a  complicated  way  and  that  are  dis¬ 
tributed  Independently  as  chi-square  variates.  In  the  special  case  of 
balanced  classificatory  models,  the  pivotal  quantities  simplify  to  scalar 
multiples  of  sums  of  squares  from  the  usual  analysis  of  variance.  The 


procedure  can  be  easily  modified  so  as  to  obtain  an  exact  confidence 
region  for  ratios  of  variance  components  and  can  be  regarded  as  a 
generalization  of  Wald's  procedure  for  obtaining  a  confidence  interval 
for  a  single  variance  ratio.  .  _ ^ 
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1 .  INTRODUCTION 

Suppose  that  y  Is  an  n  x  1  observable  vector  that  follows  the 
general  mixed  linear  model 

(1.1)  ,  -  +  Xjb,  ♦  ...  t 

where  Is  an  n  x  m^  known  matrix  (1  >  0,...,k),  and  Is  an  x  1 

vector  of  unknown  parameters,  b^  Is  an  m^  x  1  unobservable  random  vector 

2 

whose  distribution  Is  N(0,  o^I),  that  Is,  multivariate  normal  with 

2  2 

mean  vector  0  and  variance-covariance  matrix  o^I  (1-1 , . . .  ,k-f  1) ,  and  o^  , 

2 

are  unknown  parameters.  Assume  that  bp...,b^^j  are 
2  2 

Independently  and  that  o^  >  0  (1-1,..., k)  and  ^  0*  Define 

. ‘i-  ’i  -  ■  > . "> 

y  - 

2 

We  devise  an  exact  100  (l-a)X  confidence  region  for  the  vector  a 

of  variance  components.  Essentially  the  same  approach  can  be  used  to 

obtain  an  exact  100  (l-a)X  confidence  region  for  the  vector  y  of  variance 

2 

ratios.  Confidence  regions  for  a  and  y  can  be  of  direct  Interest.  They 

can  also  be  used  to  obtain  (generally  conservative)  confidence  Intervals 
2 

for  functions  of  a  or  y  (Spjotvoll,  1972;  Khurl,  1981)  and  for  linear 
combinations  of  the  fixed  and  random  effects,  that  Is,  linear 
combinations  of  the  elements  of  the  vectors  8^,  b^ ,  ...,  b^  (Jeske, 

1985). 

2 

Let  X  (^)  represent  a  chi-square  distribution  with  f  degrees 

* 

of  freedom,  and  take  x  <  to  be  the  upper-a  point  of  this  distribution. 

a ,  t 

Under  certain  circumstances,  there  exist  k+l  quadratic  forms 
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T  T 

y  A^y . .  A^^+jy»  where  are  n  x  n  symmetric  matrices  of 

T  T 

known  constants,  such  that  (1)  y  Ajy,...,y  A^^jy  are  distributed 

T  2  2 

Independently,  (11)  y  ~  X  (f^)  for  a  positive  integer 

and  a  scalar  c^  that  are  necessarily  given  by  f^  «  rank  (A^)  and 

and  Cj^  -  k+i  ‘*ij  “  (J*l,...,k)  and 

dj^  ■  tr(Aj^)  (1*1, . . .  ,k+l) ,  and  (ill)  the  (k+1)  x  (k+1)  matrix  with 

ijth  element  d^^  is  nonsingular.  If  the  k+1  quadratic  forms  y'^A^y,..., 

T 

y  A^^^j  y  exist,  then  clearly,  for  0  <  a  <  1,  a  100  (l-a)X  confidence 

2 

region  for  the  vector  a  is  given  by  the  set  S(y)  consisting  of  those 

2 

values  of  the  vector  a  that  simultaneously  satisfy  the  k+1  inequalities 


(1.2) 


^l-ai2.fi  ^  ^  ^  ’^“ir^l  . 


where  and  represent  nonnegative  constants  such  that 

®11  ^  *12  ^  ^  . ^l^^”“il”  *12^  * 

It  is  well  known  that,  for  balanced  classif icatory  models,  the 

requisite  quadratic  forms  exist  and,  in  fact,  can  be  taken  to  be  those 

sums  of  squares  in  the  customary  analysis  of  variance  that  correspond 

to  the  random  effects  and  errors,  in  which  case  f ^  +  . . .  +  f ■  n  - 

rank(XQ)  (e.g.,  Broemeling,  1969).  Other  special  cases  for  which  such 

quadratic  forms  have  been  found  are  treated  by,  for  example,  Broemeling 

and  Bee  (1976). 
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T  T 

Unfortunately,  In  many  cases,  quadratic  forms  y  A^y,*««,y 

whose  matrices  Ap...,A^^^  are  matrices  of  known  constants  and  that 

satisfy  the  three  desired  conditions,  may.  If  they  exist  at  all,  be  hard 

to  find  and/or  may  be  such  that  £  f^  <  n  -  rankCX^),  In  %fhlch  case  the 

confidence  region  S(y)  may  be  overly  large.  In  what  follows,  we  present 

a  general  procedure  for  forming  an  exact  100(l-a)Z  confidence  region  for 
2 

the  vector  a  .  Our  approach  differs  from  the  aforementioned  approach 

In  that  we  allow  the  elements  of  the  matrix  A^  of  the  1^^  quadratic  form 
T 

y  A^y  to  be  functionally  dependent  on  the  last  k-l-t-1  variance  ratios 
2  2  2  2 

Yi  ”  “  ®k^®k+l  By  allowing  this  dependence, 

T  T 

we  are  able  to  construct  quadratic  forms  y  Ajy,...,y  satisfy 

the  three  desired  properties  and  which.  In  addition,  are  such  that 

£  f^  <•  n  -  rank(XQ).  Then,  as  In  the  aforementioned  approach,  an  exact 

100(l**a)%  confidence  region  consists  of  the  set  S(y)  of  all  values  of 
2 

the  vector  o  that  simultaneously  satisfy  the  k+l  Inequalities  (1.2). 

Our  procedure  can  be  regarded  as  an  extension  of  Wald's  (1940  and 
1947)  procedure.  Wald's  procedure,  when  extended  along  the  lines 
discussed  by  Thompson  (1955),  Spjotvoll  (1968),  Seely  and  El-Basslounl 
(1983),  and  Harvllle  and  Fenech  (1984),  covers  the  special  case  k-1. 
Hartley  and  Rao  (1967, Sec.  9)  proposed  a  general  procedure  for  obtaining 
an  exact  100  (l-a)Z  confidence  region  which,  like  ours,  can  be  viewed  as 
a  generalization  of  Wald's  procedure.  However,  as  can  easily  be 
shown.  It  produces  confidence  regions  of  a  seemingly  unappealing  form 
and.  In  fact,  can  with  high  probability  produce  confidence  regions  of 
Infinite  volume. 
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In  principle,  the  likelihood  ratio  could  be  used  to  generate  a 
confidence  region  (Hartley  and  Rao,  1967,  Sec.  8).  However,  the 
percentiles  of  Its  distribution  are  approximated  on  the  basis  of 
asymptotic  results.  The  accuracy  of  these  approximations  Is 
questionable  and  Is  not  easily  Investigated.  (What  few  asymptotic 
results  are  available  for  mixed  linear  models — see,  for  example.  Miller 
(1977) — seem  unassurlng,  and,  except  In  special  cases,  Monte  Carlo 
studies  are  computationally  unfeasible.) 

2.  PRELIMINARIES 

Define  V  -  I  +  I*'  X  X^  (1-0, . . . ,k-l)  and  -  V,  -  I,  and 

1  S-I+I'S  S  S  •  t  » 

let  V  -  -  I  +  iY„X^x3*  Under  the  assumed  model  (l.l), 

u  s- 1  s  s  s 

2  2 
y  ~  N(XqPq,  and  the  parameter  space  for  the  vector  a  is 

Qq  -  >  0,  Yj  >  0  (1-1 . k)}. 

Note,  however,  that  the  matrix  V  Is  positive  definite  for  some  values  of 
the  vector  y  that  Include  one  or  more  negative  elements. 

Subsequently,  we  take  the  model  to  be  the  generalization  of  model 

(1.1)  that  results  from  disregarding  our  original  definitions  of 

2  2  2 
as  variances  and  from  assuming  only  that  y  ~  N(XqPq, 

2 

and  that  the  parameter  space  for  the  vector  a  Is 

1* 

^  ( l-"0  j  •  •  •  jk*l ) }  j 


Q  -  {o^  :  0^  ,  >  0,  (Y,* 


—Ill  II I  llli  li  il  ■illilMliillii  1 1  Wl  ■  II 1 1  ■  I  III  IIP!  Ii  I  '111  III  Ml  li  II I  inn  II*  W  mi  111  II  'I  I  II  inilll  HI'll  <  imiiwim  i  i  ■  m  i  in  ■m  i  n  im  I  ■!!  ■  ■  ■■! 
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T 

vhere  is  the  set  of  all  values  of  »***»Y^^  such  that  is 

positive  definite. 

We  shall  have  occasion  to  refer  to  a  model,  to  be  called  Model  1, 
in  which 

S'  -  Vo  *  "l*'!  *  —  *  Vl  *  ‘l- 

where  . are  vectors  which,  like  0^,  are  composed  of  unknown 

parameters,  where  e^  is  an  n  x  1  unobservable  random  vector  with 

2 

E(ej^)  -  0  and  var(ej^)  -  where  and  hence  are 

assumed  to  be  known  (i««l , . . . ,k) .  Model  i  is  essentially  the  same  as 
model  (1.1)  except  that  the  parameter  vectors  0^,...,^^  appear  in  place 
of  the  random  vectors  bj,...,b^  and  the  parameters  are  taken 

to  be  known  Instead  of  unknown. 

For  1  -  0,1, ...,k,  define  X*  -  (X^,  X^,...,Xj).  Note  that  X*  - 

(X*_j,  X^)  (i»l,...,k).  Let  r^  -  ranhCx^),  r^  -  rank(Xj)  -  rank(Xj_^j) 

* 

(i-l,...,k),  and  ■  n  -  rank(Xj^).  Subsequently,  we  assume  that 

r^  y  0  ( 1"1 , . . . ,k+ 1 ) . 


We  write  A**  for  an  arbitrary  generalized 
that  is.  A"  is  any  matrix  satisfying  AA~A  *  A. 

*  *T  -1  *  *T  -1 

'i-i  • 


inverse  of  a  matrix  A, 
Define 

(  i*  1 , . . . ,k+ 1 ) . 


Note  that 


2  T  -I  -1  T  -1 

P  -  P  .  P  V  ■  (V  P  )  ■  V  P 

i-1  1-r  1-11  ^1  i-r  1  1-1 


and  that  ’^l-l'^I^^l-l 


<^-l<-l>\' 


1-1  1  ’ 
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Implying  that,  for  j  0,....,1-1, 


T  -1  T  -1  *  *  *T  -1  *T  -1 

^1-1  "  ^j'^l  ’  ''l  *’l-l  *  '^l 

(i“l , . . .  ,k+l) .  Note  also  that,  since  the  matrix  I“Pjj  ■  I  - 
*  Af  *  <-  * 

X^(Xj^  Xj^)  X^  Is  symmetric  and  Idempotent,  there  exists  an  n  x  r^^^^ 
matrix  F  such  that  I“Fjj  *•  FF^  and  F^F  »  I  and  that  FF^X^  “0,  implying 
that  F^X^  -  0  (1-0 . k). 


3.  QUADRATIC  FORMS 


3.1  Definition 

We  now  Introduce  the  quadratic  forms  on  which  the  proposed 

confidence  region  Is  to  be  based. 

Define  C^  -  )X^  and  )y  (1-1 . k). 

It  Is  known  that  rank  (C^^)  -  r^  and  that  there  exists  an  m^^  x  r^  matrix 

T 

of  rank  r^  such  that,  under  Model  1,  would  be  an  estimable 

T  T 

parametric  function.  It  Is  further  known  that  A^  -  for  some  matrix 

which  is  necessarily  of  rank  r^^,  and  that,  under  Model  1,  the 
minimum-variance  linear  unbiased  estimator  of  would  be 


~  ~  xT 

^1  “  ^^ir***’‘'ir^)  " 


tT 

Vl’ 


The  proposed  confidence  region  Is  based  on  the  k-fl  quadratic  forms 
Qf  -  Qi(Yj..*.,T,^;y)  -  (i-i . k). 
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T  2  2 

where  z  »  F  y.  As  shown  In  Section  3.2,  »•  •  •  * 

distributed  Independently  as  chi-square  random  variables  with  degrees 

of  freedom  r^ , . . . ,  respectively.  The  quadratic  forms 

are  Invariant  to  the  choice  of  the  matrices  Ap...,A^,  as  Is  easily 

verified. 


3.2  Joint  distribution 


Under  the  assumed  distribution  for  y,  which  Is  N(XqPq ,  we 
find  that,  for  j  <  1  -  l,...,k,  cov(qj^,qj) 


^3.1) 


-  0  -  0  -  0. 


It  can  be  shown.  In  similar  fashion,  that  cov(z,qj)  »  0  (j»l,...,k). 
Thus,  qp...,q^,z  are  distributed  Independently,  Implying  that 
Tj,...,Tj^,z  are  distributed  Independently  and  hence  that  Qj  *  •  •  •  tQjjpQjj+j 


are  distributed  Independently. 
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Further,  for  l"l,...,k,  we  find  that 


(3.2) 


Kqj)  -  ■  »• 


(3.3) 


^  2  T  2 

Implying  that  N[0,  hence  that 

2  2  2 

~  X  (r^).  Also,  z  ~  N(0,Oj^^jI),  and,  consequently, 

^k+l^®k+l  ~  ^  ^’^k+l^* 


3.3  Canonical  representation 

Ue  now  consider  a  particular  choice  for  the  parameter  vectors 

T  T 

Tj . or,  equivalently,  for  the  coefficient  matrices  Aj,...,Aj^. 

This  choice  produces  representations  for  the  quadratic  forms  Qp...,Q|^ 
that  are,  as  we  discuss  In  Section  4,  informative  about  the  nature  of 
the  proposed  confidence  region,  and  that  can  be  useful  computationally 
Let  *  •  •  •  represent  the  nonzero,  and  hence  positive, 

characteristic  values  of  the  matrix  C^^,  define  -  dlag(Aj^^  , . . .  ) 

and  take  to  be  an  m^  x  r^  matrix  whose  columns  are  orthonormal 
characteristic  vectors  of  C.  corresponding  to  the  values  A,,,..., A,  , 
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respectively.  Thus,  by  definition. 


CiRi  -  RjDj,  *  I, 


Implying  that 


rJCiRi  -  D^,  RjC^R^  -  (C^R^)'^CjRj  -  D^R^RiD^  -  D^. 
Consider  the  choice 


T  _  -1/2  T  .  T 

Ai  Dj 

T  -1/2  T 

with  Note  that  this  choice  varies  with 

leads  to  the  representation 


.Yu* 


(3.4) 


Ql  =  x^(I.Y,D^)-  X,  = 


~  -1/2  T 

with  Dj  *^1^1‘ 


3.4  An  alternative  approach 


The  quadratic  forms  Qj.***.Qi^+j  defined,  and  their 

properties  established,  via  a  vector-space  approach,  as  we  now 

demonstrate.  Denote  by  r"  the  vector  space  consisting  of  all 

n-dlmenslonal  real  vectors,  and  let  VT*"  represent  the  orthogonal 

complement  of  a  subspace  W  of  r”  with  respect  to  the  usual  Inner  product, 

T 

that  Is,  the  Inner  product  that  assigns  the  value  7^X2 

n-dlmenslonal  real  vectors  y^  and  y^.  Further,  let  represent  the 

T 

orthogonal  complement  of  W  with  respect  to  the  Inner  product  y^Vjy2. 
Denote  by  C(A)  the  column  space  of  a  matrix  A. 


Confidence  Regions  For  Variance  Components 


10 


Define  *"** 

0i  -  C(X*_j)^  n  [C(X*)^]^i  (1-1 . k). 

Then,  dim(Uj^^j)  -  r^^^^  and 

dim  (U^)  -  dlmIC(X*_j)^l-dlmIC(X*)^l 

-  n-rank(X*_j)-tn-rank(X*)l  -  r^^  (i-1 . k). 

Take  to  be  an  n  x  r^  matrix  whose  columns  form  a  basis  for 

(i-1 , . . . ,k+l) .  Derivations  paralleling  those  of  results  (3«1),  (3.2), 

and  (3.3)  reveal  that  H^VHj  -  0  (j<i-l . k+1),  that  hJXq  -  0 

(i-l,...,k+l),  and  that  hJvh^  -  hJ(V^  +  ‘^1*1*1^’*!  and 

T  T 

^Sc+l'^^+l  “  *Ht+l\+l*  ^1***’*\+1  ®*’®  0’ft''Ogonal  with  respect 

T  i 

to  the  inner  product  y^^y2  their  direct  sum  Is  C(Xq)  • 

T  T 

We  have,  in  effect,  established  that  the  vectors  are 

distributed  independently,  with  ~  N(0,  Ofc+iHj(Vj  +  YjX£X£)Hj1 
(1-1,..., k)  and  H^^^y  ~  N(0,  follows  that  the  k+1 

quadratic  forms 

Q*  -  y\(Hj(Vj  +  YiX^xJ)H^l"^Hjy  (1-1 . k) 

*  T  T  —IT 

Vi  ■  >-  Vi<ViVi>  Vi’' 

*  2  2 

are  distributed  Independently,  with  ~  X  (r^)  (1-1 , . . . ,k+l) . 

These  quadratic  forms  are  invariant  to  the  choice  of  the  basis  matrices 
H, . ^4.1  >  ^a  is  easily  verified. 
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T  T 

It  Is  easy  to  show  that  one  choice  for  the  matrices  Is 

-  ^l^i'^1  ^ ^ ^"**1- 1  ^  (1*1, ■  F^.  This  result  can.  In  turn, 

*  * 

be  used  to  show  that  the  quadratic  forms  Identical  to  the 

quadratic  forms  >  Introduced  In  Section  3.1.  The 

representations  Q^>***>Qy^^y  Informative  about  the  nature  of  the 
computations  required  to  evaluate  the  k+1  quadratic  forms  for  specified 
values  of  Y2»***»Tjj* 


4.  NATURE  OF  CONFIDENCE  REGION 
4.1  General  case 

2  2 

The  set  S(y),  consisting  of  those  values  of  the  vector  a  (a  e  Q) 
that  simultaneously  satisfy  the  k-fl  Inequalities 

’^l-a  r  ^  ^l^®k+l  ^  r 

^12*  1  11’  1 

2 

Is  a  100(l-a)Z  confidence  region  for  a  .  We  now  present  an  alternative 
description  of  this  set,  one  which  provides  more  Insight  Into  the  nature 

of  the  set  and  which  Is  more  useful  computationally. 

* 

Let  represent  the  maximum  characteristic  value  of  the  matrix 
T  -1  * 

^I'^l  ^1’  define  -  max(A^j , . . .  ,A  )  (!>!,..., k).  It  Is  easy  to 

*  * 

show  that  <  Xj^.  For  any  fixed  value  of  » •  •  •  the  matrix 

T 

"  '^1  "'^1*1*1  positive  definite  for  those  values  of  belonging 

* 

to  the  Interval  -l/X^^  ^  ^1  ^  quadratic  form  Is  a  well- 

,  * 

defined  function  of  Yj  over  the  more  extensive  Interval  -l/A^  <  Y^^  <  “»  «8 
Is  evident  from  representation  (3.4).  Further,  as  a  function  of  y^.  Is 


S  •»  *.  *• 

-.'V  -  • "  ^ 


•  '  ^  -  *  •  ''i 


•  •  ‘ 
»  »  «  •  «  V 

•  !*,*  •-*  O 

jm  “_m 


•«  •.  •  "W  • 


_«*•  *  •  •  ^  »  • 

1^..“ -A.*  • ‘Vr 


•• 

N  •  .  -V  -.'-' 


rKrb 


-*****ji**’.  •* 
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strictly  decreasing  and  strictly  convex  over  the  interval  -1/Aj  <  <  “f 

and 

limit  Q.  ■  limit  Q.  ■  0 

Ti-*-  -IMJ  Y^-^  • 

(Harvllle  and  Fenech,  1984,  Section  3)*  For  convenience,  define 


Ql(-lMi,  Yj^+i . Y,j;y)  -  -  and  •,Yj^;y)  -  0 

* 

^k+r^a 


■  Vi'». . t. . .  Vi  •  Vi‘»>  ■ 


■k+l,l**^fcfl 


Further,  for  any  fixed  value  of  (y 
2 


1+1 . 'k 


•yj  e  r, 


and 


^  ^  k+1,2’  k+1 

and  for  any  fixed  positive  value  of  define  ■  *1^^1+1 '*  *  * ’^k* 

2  2  * 
fo  fhe  unique  value  of  y,  that  satisfies  Q.  ■  o,  .,y 
k+l’^  ^  '1  ^1  k+l*a  ,r- 

2  X 1  X 

Uj^  ■  “l^^l+l  *  *  *  *  *Xij»®k+l  ***  unique  value  of  Yj^  that  satisfies 

2  * 

Q.  “  ».  •  Then,  an  alternative  description  of  the  100(l-a)X 

1  lt+  A  ^^12 

confidence  region  S(y)  Is  S(y)  > 


{a  :  a  e  Q,  ^  ®1  ^  ®k+l“l  (^“1  * •  •  •  .h)} . 


4.2  Special  case 

The  upper  and  lower  bounds  u^  and  on  y^  were  defined  as  solutions 

to  equations  In  Y^whlch  are.  In  general.  Inherently  nonlinear  and  not 

amenable  to  explicit  solution.  We  now  consider  a  special  case  where  these 

equations  can  be  solved  explicitly. 

Let  represent  the  value  of  the  matrix  when  Yj^^j  ■  •••  ■  Yj^ 

(0)  *  *T  *  -  * 

■  0;  that  Is,  Pj^_j  ■  *i-i^^l-l^l-l^  Xj_j(l*l , . . . ,k+l) .  Note  that  the 
T  (0) 

quadratic  form  y  (1~Pq  )y  represents  the  residual  sum  of  squares  obtained 
from  a  least  squares  fit  of  the  submodel  y  > 
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Following  Brown  (1984),  we  define  an  analysis  of  variance  for  the 

T  (0)  T  T 

variance  components  to  be  a  partitioning  y  (I“Fq  )y  ■  y  A^y  +  ...  +  y  A^y, 

where  Ap...,A^  are  n  x  n  symmetric  matrices  of  known  constants,  such  that 

T  T  T  2 

(1)  y  Ajy . .  A^y  are  distributed  Independently,  (11)  y  ~ 

2 

X  (f^)  for  a  positive  Integer  f^  and  a  scalar  c^  (1-1 . s) ,  and  (111) 

the  scalars  Cp...,c^,  which  are  necessarily  linear  functions  of  . . 

are  distinct.  Brown  showed.  In  effect,  that  an  analysis  of  variance  for 

the  variance  components  exists  If  and  only  If  the  matrices 

(I-Pq®^)XjXj(I-Pq°^ ),..., (I-Pq®^)Xj^x][(I-Pq®^)  commute  In  pairs.  In  which 

T  T 

case  the  sums  of  squares  y  A^y,...,y  A^y  are  unique  up  to  order. 

In  the  Appendix,  we  show,  by  construction,  that.  If  an  analysis  of 
variance  for  the  variance  components  exists,  then 


"l  ■ 


(1-1 . k+1). 


where  1^...,!^^^^  represents  a  partitioning  of  the  Integers  l,...,s  Into 
k-t-l  sets.  We  show  further  that.  If  s  -  k4-l,  then,  for  1-1,..., k, 

Q - - 

where  *  r~^tr[Xj(P^®^-  (j-l,...,k). 

We  see  that.  In  the  special  case  where  there  exists  an  analysis  of 
variance  for  the  variance  components  and  where.  In  addition,  s  -  k+l, 

(4.1)  «,  -  -  '■  * 


-V.V.  W".  a*.  1^.  *. 


IvXiv'.-'lv.'V-! 
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(4.2)  »1  -  -  (1  * 

and  that,  for  balanced  classlflcatory  models,  our  procedure  for  forming  a 

confidence  region  reduces  to  the  traditional  procedure.  Note  that.  In  this 

2 

special  case,  and  u^  are  linear  functions  of  *nd  or, 

2  2  2  2  2 
equivalently,  ‘*k+l'*l  linear  functions  of  , . . .  ,Oj^, 

Even  If  an  analysis  of  variance  for  all  k+l  variance  components  does 

T  (0) 

not  exist,  there  may  exist  a  less  extensive  partitioning  y  )y  •• 

T  T  T  T 

yA,  y  +  ...'t-yAy  such  that  y  A,  . . .  A  y  are  distributed  Independently 

as  distinct  scalar  multiples  of  chi-square  random  variables.  In  which  case 

formulas  (4.1)  and  (4.2)  are  still  applicable,  at  least  for  1  -  k'-fl,...,k. 


5.  DISCUSSION 

Some  modifications  of  the  proposed  procedure  for  obtaining  a 

2  2 

confidence  region  for  the  variance  components  Op...,o^^^  and  some 
considerations  In  Its  Implementation  are  as  follows: 

1.  The  proposed  procedure  can  be  modified  so  as  to  obtain  a 

confidence  region  for  the  variance  ratios  Define  ■ 

2  2  2  2 

. where  represent  Independently 

distributed  chi-square  random  variables  with  degrees  of  freedom  rp...,r^^^ 

respectively.  Take  F^'^\  F^'*^  (1-1,..., k)  to  be  any  constants  such  that 

prlF^^^  <  Fj  <  (1-1,..., k)l  -  1-a. 

Then,  a  100(l-a)X  confidence  region  for  Yjf«>Y|^  is  fhe  set 

s*(y)  -  {y  !  (Y4.***.Yu)^  t  r,  ,,  x*  <  y.  <  (i-i,...,k)} 
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*  * 

where  .  •  •  •  .Yy^;y)  1*  the  unique  value  of  Yj^  that  satisfies 

"  ^1**^  **“*  “i  “  “l^^i+1 . unique  value 

of  Yj  that  satisfies  In  the  special  case  of  a 

balanced  classlflcetory  model  and  for  ...  -  -  0,  the  confidence 

* 

region  S  (y)  slm|>llflea  to  that  discussed  by  Sahal  and  Anderson  (1973). 

If  k  is  sufficiently  large,  the  determination  of  the  constants  F^'^\ 
F^®^  (l~l,...,k)  may  be  computationally  unfeasible.  In  which  case  our 
procedure  for  obtaining  an  exact  100(1 -a)Z  confidence  region  can  not  be 
implemented.  Homver,  an  approximate  100(l-a)Z  confidence  region  can  be 
obtained  by  replacing  F^*^  and  F^'*^  by  the  upper-(  1-0^2^  upper-o^^j 
points,  respectively,  of  the  marginal  distribution  of  F^.  It  follows  from 
Kimball's  (1951)  inequality  that,  for  ■  0,  this  region  Is 

conservative;  that  is.  Its  probability  of  coverage  equals  or  exceeds  1-a. 
In  the  special  case  of  a  balanced  classlflcatory  model  and  for  >  ...  > 
~  0,  the  approximate  confidence  region  simplifies  to  that  proposed  by 
Broemellng  (1969). 

2.  The  proposed  procedure  for  using  the  quantities  Qj  *  •  •  • 

2  2 

form  a  confidence  region  for  the  variance  components  or  the 

variance  ratios  Yp**«>Yf^  can  be  modified.  In  an  obvious  way,  to  obtain  a 

*  2  2 

confidence  region  for  the  last  k-k  +2  variance  components  or 

* 

the  last  k-k  +1  variance  ratios  Yv*i***>Yk»  based  on  the  quantities 
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3.  By  following  the  approach  described,  for  example,  by  Spjotvoll 

(1972)  and  Khurl  (1981),  the  proposed  confidence  region  for  the  variance 
2  2 

components  o^ . ^k-t-l  variance  ratios  can  be  transformed  Into 

a  generally  conservative  confidence  region  for  a  function  of  the  variance 
components  or  variance  ratios  or,  more  generally,  for  a  family  of  such 
functions.  For  Instance,  let  R£(y)  represent  the  range  of  a  vector  f  > 
f(a^ , • • . ,o^^^)  of  functions  of  variance  components  when  the  domain  of  f 
Is  restricted  to  the  set  S(y).  Then,  clearly,  pr[feR^(y)]  >  1-a;  that  Is, 
R^(y)  Is  a  generally  conservative  100( 1*^)2  confidence  region  for  f. 

4.  Often,  the  parameter  space  for  the  vector  Is  a  proper  subset 

Q*  of  the  set  Q,  rather  than  Q  Itself.  In  particular,  the  parameter  space 
for  may  be  the  set  Qq. 

The  confidence  region  S(y)  may  Include  values  of  cr^  not  belonging  to 
Q*.  Let  S'(y)  represent  the  subset  of  S(y)  obtained  by  deleting  all  such 
values.  For  c  O',  prlo^  c  S'(y)]  ■  prlo^  c  S(y)]  ■  l-o.  Thus,  when  the 
parameter  space  for  Is  Q',  S'(y),  like  S(y),  Is  a  100(l-a)Z  confidence 
region  for  . 

In  the  special  case  O'  >  Qqi  the  100(l-a)Z  confidence  region  S'(y) 

Is  obtained  from  S(y)  by  deleting  all  values  of  for  which  one  or  more 

2  2 

of  the  first  k  variance  components  o^,...,0|^  Is  negative.  Expanding  the 
set  S'(y)  slightly,  we  obtain  the  set  S^(y)  ■ 

0^+1  >  0,  max(l^,0)<o^<o^j  max(Uj^,0)  (1-1 . k) 

We  have  that  pr(o  e  S  (y)]  equals  1-a,  if  >  0  for  and  is 
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greater  than  or  equal  to  1-a,  otherwise,  as  Is  easily  verified.  Thus, 


when  the  parameter  space  for  Is  Qg,  S^(y)  Is  a  conservative  100(l-a)Z 


confidence  region.  Unlike  S'(y),  the  set  S‘*'(y)  Is  nonempty,  with 


probability  one. 


5.  There  will  generally  be  more  than  one  way  to  express  a 


particular  mixed  or  random  linear  model  as  a  special  case  of  the  general 


mixed  linear  model  (1.1).  Consider,  for  example,  the  customary  additive 


model  for  a  two-way  crossed  classification  with  both  factors  regarded  as 


random.  Ue  can  take  the  elements  of  b^  to  be  the  effects  of  the  first 


factor  or,  alternatively,  the  effects  of  the  second  factor. 


It  should  be  noted  that,  except  for  highly  structured  situations. 


like  those  considered  In  Section  4.2,  the  confidence  region  S(y)  will 


vary  with  the  order  In  which  we  assign  the  various  sets  of  random  effects 


to  the  vectors  b|,...,b|^. 


6.  Note  that,  by  exploiting  general  relationships  between 


confidence  regions  and  tests  of  hypothesis  or  significance,  the  proposed 


procedure  for  forming  a  confidence  region  can  be  used  to  test,  against 


appropriate  alternatives,  a  null  hypothesis  of  the  general  form 


"o’  “l  ■  'l . "mi  ■  Vl  "  "o’  ”1  ■  'l . ’k  ■  ■t.-  'l . Vl 


represent  specified  constants. 


Also,  following  Harvllle  and  Fenech  (1984,  Section  7),  we  can 


obtain  point  estimators  of  the  variance  components  or  variance  ratios  by 


2  2 

equating  the  pivotal  quantities  »•  •  •  or  * 


. . .  ,(rj^^j/rj^)Qj^/Qj^^j  to  appropriately  chosen  constants.  In 


mssi 


K 


fS-i-fc-i' 


.<  -• 


•  ■  *  -  A  . 


•.'.■•./.••■/A 


aV-V-'.-- 


-V- s.‘- 1  •  .'i 


O  V'.  V  v' V 


-  >>V- 


.*•  •  •  W**  ►'  *  «’•  »*•  1,' 


I 
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2  2 

particular,  If  we  equate  ^o  r^ . 

respectively,  we  obtain  estimators  which.  In  the  special  case  where 
there  exists  an  analysis  of  variance  for  the  variance  components  and 
where  s  ••  k+1,  simplify  to  the  usual  analysls-of-varlance  estimators. 
Alternatively,  If  we  equate  these  quantities  to  the  medians  of  their 
respective  chi-square  distributions,  we  obtain  estimators  that  can 
be  Interpreted  as  the  coordinates  of  a  degenerate,  single-point 
confidence  region. 

(t) 

7.  Let  Q  -  Q,  +  ...  +  Q,  and  r  -r,  +  ...  +  r,  ,  and  define  F  and 
x,  •  1  k’ 

F^**^  to  be  any  constants  such  that  pr[F^^^  <  F  <  F^'^^1  *  1-a,  where  F 

represents  an  F  random  variable  with  degrees  of  freedom  r^  and 

2  2 

respectively.  Clearly,  the  distribution  of  Is  X  (*■,)»  s”**  Q, 

are  distributed  Independently.  It  follows  that  an  exact  100(l-a)Z 

confidence  region  for  . .  alternative  to  the  confidence  region 

*  # 

S  (y).  Is  the  set  S  (y)  consisting  of  those  values  of  y  that  satisfy  the 
Inequality 

(t) 

It  can  be  shown  that.  In  the  special  case  where  F  <■  0  (and  hence 
where  F^**^  Is  the  upper-a  point  of  the  distribution  of  F),  the  confidence 
region  S^(y)  Is  the  same  as  the  exact  confidence  region  devised  by  Hartley 
and  Rao  (1967,  Sec.  9).  We  do  not  recommend  the  confidence  region  S^(y) 
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since,  as  previously  Indicated  In  the  apecial  case  of  the  Hartley-Rao 

region.  It  has  a  seemingly  unappealing  form  and  can,  with  high 

probability,  produce  confidence  regions  of  infinite  volume. 

8.  In  practice,  we  may  want  to  display  graphically  the  confidence 

region  S(y),  or  when  k  exceeds  one  or  two,  to  display,  for  each  of  various 

2  2  2 

fixed  values  of  the  vector  and  the  Integer  1,  the 

2 

Interval  of  values,  or  perhaps  the  two-  or  three-dimensional  set  of 
2  2  2  2  2 

Oj_^,  or  values,  that  are  represented  In  S(y).  We  are 

then  faced  with  the  computational  problem  of  determining  the  numerical 

2  2  2 

values  of  and  u^  corresponding  to  each  value  of  >  *  *  * 

and  each  value  of  1. 

In  the  special  case  where  there  exists  an  analysis  of  variance 

for  the  variance  components  and  where  s  >  k-fl,  this  problem  reduces. 

In  effect,  to  that  of  computing  the  entries  in  the  analysis-of-varlance 

table.  Hore  generally,  we  can  use  the  approach  discussed.  In  the 

special  case  1  -  k,  by  Harvllle  and  Fenech  (1984,  Section  4),  to  compute 

the  values  of  A,, . A.  and  of  If. ,,...,7,  and  to  then  determine  the 

11’  ’  ir^^  11’  ’  ir^ 

values  of  and  u^  graphically  or  Iteratively.  How  a  series  of  such 

2  2  2 

determinations.  Involving  various  values  of  the  vector  » •  •  • 

and/or  the  Integer  1,  might  be  accomplished  most  efflcently  Is  a  question 
for  possible  Investigation. 


We  now  derive,  for  the  special  case  where  there  exists  an 
analysis  of  variance  for  the  variance  components,  the  simplified 

representations  given  In  Section  4.2  for  the  qtiadratlc  forms 

(0)  (0)  *  1  *  to) 

Let  -  C(Xq),  ^C(X^)  (1-1 . k),  and  u;;“j 

CCX^)"*",  so  that,  for  l-l,...,k+l,  0^®^  represents  when 

-  0.  Define  to  be  an  n  x  r^  matrix  whose  columns  form  an 

orthonormal  basis  for  with  respect  to  the  ordinary  Inner  product 

it 

Note  that  the  columns  of  the  matrix  form  an 

* 

orthonormal  basis  for  C(X^)  (l-0,...,k)  and  that  the  columns  of  the 
matrix  M  -  form  an  orthonormal  basis  for  C(Xq)‘*’.  Note 

also  that,  for  1-0,.. •,k, 

*  *T  *  *T  *  _l  *T  *  *T  *  -  *T  fO) 

“l”l  "  M^)  \  -  X^(X^*Xj) 

and  that,  for  1-1,..., k, 

T  *  *X  *  *x  (0)  (0) 

Vi  -  ”i«i  -  ”i-i”i-i  -  ^1  -  "i-r 

xJmimJxi  -  x[(p[°^-p[°j)x^  -  x[(l-p[°j)x^, 

(A.l)  rank  (mJx^xJm^)  -  rank  (X^M^M^X^)  -  rank  lxJ(I-p[°j)X^]  -  r^. 
Suppose  now  that  there  exists  an  analysis  of  variance  for  the 
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T  T 

variance  components.  Then,  the  k  matrices  N  X^X^M  - 
dlag[(Mj,...,M^)^  X^X^(M^,...,M^),  0]  (l-l,*..,k)  commute  In  pairs 
(Brown,  1984,  Corollary  2),  which  Implies  that,  for  1-1 . . 

(A.2)  mJx^xJm  -  dlag(MjXjXjMj , . . . ,MjX^xJm^,  0, ... ,0) , 

as  we  now  show. 

Our  proof  of  result  (A.2)  Is  by  Induction.  The  result  Is  clearly 
valid  for  1-1.  Suppose  that  It  Is  valid  for  1-1 , . . . ,1'-1 .  Then,  since 
n'^XiX^M  and  H^X^,X^, M  commute.  Implying  that  the  IJth  block  of  the 
matrix  M^X^xJmM^X^,X^,M  Is  the  same  as  that  of  M^X^.X^.MM^X^X^M, 

we  have,  for  1-1,...,1'-1  and  j-1+1 , . . . ,1* ,  that 


T  T 


and  hence.  In  light  of  result  (A.l),  that  M^X^,X^,Mj  -  0.  Observing  that 
T  T 

M  X^X^M  Is  symmetric,  we  conclude  that  result  (A.2)  Is  valid  for  1-1* , 
which  completes  the  proof  of  this  result. 

Now,  since  the  matrices  m'^X^xJm,...,M^Xj^x|^M  commute  In  pairs,  we 
have.  In  light  of  result  (A.2),  that  the  matrices  M^X^^X , . . .  ,MjX^X^Mj^ 
commute  In  pairs.  Consequently,  there  exists  an  orthogonal  matrix  that 
simultaneously  diagonalizes  the  matrices  N^X^X^H^, .. . ,M^X^X^H^;  that  Is, 
there  exists  an  orthogonal  matrix  0^  such  that 

oJmJXjXjM^O^  -  dlag(Xj^j,...,X^^^^) 


for  some  nonnegative  real  numbers  \ 


(J"l»...»k) 
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As  a  further  consequence  of  result  (A*2),  we  have  that 


. Vl)  -  I'.VlX"!*! . Vl>  ■  “• 


Implying  that  the  columns  of  and  hence  those  of  form  a  basis 


for  Ui  (1-1,..., k). 

For  1-1,..., k,  define 


T  T 

”  (w^p...,w^^  )  -  y. 


define  » •  •  •  distinct  linear  functions  represented 

1  V  k 


among  the  r. linear  functions  1+E,  .X..,y, . l+£.  v. ,  and,  for 

1  j-1  Jll  j  J“1  jlr  'J* 

*  ^ 


m-l,...,r. ,  take  T,  to  be  the  collection  of  values  of  the  Index  t  for 
1  m 


which  1+Z,  -X..  V,  -  c.  ,  take  f.  to  be  the  dimension  of  this 
j-1  jit  j  im  Ira 


collection,  and  let  W.  -  _  w^^. 

itn  tCT .  1C 
Im 


It  follows  from  the  results  of  Section  3.4  that 


T  T  T 

,  *  *a  *  %r  %r  * 


v-1 


Qi  -  Ql  -  y 


E  ,  W,  /c, 
m»l  im  Iro 


(1-1, ...,k). 


Vi  -  Cl  -  yVi<Ci\^i>‘'Ciy  “  yX+iCi^ 


Further  we  find  that 

* 


,k  „’^1 


and,  proceeding  as  In  Section  3.4,  that  W.  (1-1,..., k;  m-l,...,r.), 

im  1 


2  2 

are  distributed  Independently  and  ~  X 


Implying  that  W.  (l-l,...,k;  m-l,...,r.),  Q.  ,  are  the  sums  of  squares 
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In  the  analysis  of  variance  for  the  variance  components. 

Now,  suppose  that  the  analysis  of  variance  for  the  variance 
components  Is  such  that  s  -  k+1.  Then,  for  l«l,...,k. 


for  some  real  number  given  by 
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